CDS

Accession Number TCMCG075C21942
gbkey CDS
Protein Id XP_007021566.2
Location complement(join(5730819..5730905,5731143..5731240,5731515..5731629,5731708..5731755,5732027..5732164,5732925..5733022,5733563..5733670,5734021..5734118,5734250..5734320,5734409..5734570,5734968..5735039,5735154..5735220,5736410..5736513,5736880..5736942,5737307..5737415,5737718..5737854))
Gene LOC18594054
GeneID 18594054
Organism Theobroma cacao

Protein

Length 524aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007021504.2
Definition PREDICTED: 26S proteasome non-ATPase regulatory subunit 5 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category O
Description 26S proteasome non-ATPase regulatory subunit
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko03051        [VIEW IN KEGG]
KEGG_ko ko:K06692        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs GO:0000502        [VIEW IN EMBL-EBI]
GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005838        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0008540        [VIEW IN EMBL-EBI]
GO:0009987        [VIEW IN EMBL-EBI]
GO:0016043        [VIEW IN EMBL-EBI]
GO:0022607        [VIEW IN EMBL-EBI]
GO:0022624        [VIEW IN EMBL-EBI]
GO:0032991        [VIEW IN EMBL-EBI]
GO:0034622        [VIEW IN EMBL-EBI]
GO:0043248        [VIEW IN EMBL-EBI]
GO:0043933        [VIEW IN EMBL-EBI]
GO:0044085        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0065003        [VIEW IN EMBL-EBI]
GO:0070682        [VIEW IN EMBL-EBI]
GO:0071840        [VIEW IN EMBL-EBI]
GO:1902494        [VIEW IN EMBL-EBI]
GO:1905368        [VIEW IN EMBL-EBI]
GO:1905369        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGATGAAGAGTTCTCTGTAGACGATCCGACTCACTTGCTGGACTCAGCTTCTGAGTTCGCTCATCACCCTGGTGTTCAAAACGACGCCACCACAAAAGATTTTCTTGACCGTTTCCCTCTCCCTGTCATCATCAGTGCTTTGCAAACCAAAGCTGATGTGCCTGGCCTGGAAAATACTCTGGCTGATTGTTTGGAAAGGGTTTTTAAGACCAAGTATGGTGGCTCACTCATTCCACAATACATGCCGTTCCTTCAAGTTGGCCTTAAGGCAGATTCTCAAATGGTTCGTTGTTTGGCATGCAAAACGGTTTCATCCTTTTTAGAGAACTTTGATGACAAATCTATATCTGCCATACAGCTGATTATTGACTATGATCTATATCCACTCTTACTGGACTGCCTAATTTATGGCAATGAACAAGTTGCAACTGCTGCAATTGATGCAATCAAGAACTTAGCTCGGTTTCCTGAAGGCATGAGCATCATCTTTCCAGCTAACATAAATGAAGTTGCGCATCTTGGGAACTTAGCATCACGATGTTCATCATTGGGACGTGCACGGGTTTTATCATTGATAGTGAAGTTATTCTCTATTTCCAGCTCTGTAGCTTCAGTAATATACAATTCAAATTTGCTCAGTTTATTGGAGGCAGAAATCAGGAATTCAAATGATACCCTTGTAACCCTAAGTTCTTTGGAGCTCTTGTATGAGTTGACTGAGATCCAGCATGGTACAGAGTTCTTGTCTAGGACCACCCTTCAATTACTTCATTCTATAATCAGCAACTCATCAATGGAAGGAATTCTAAGATCAAGAGCAATGATGATAAGTGGAAGGCTTTTATCCAAGGAGAACATATACATGTTTGTTGATGAACTGAGTGCCAAAGGTGTGATTTCAGCCATTGATGTGAGACTTGGCCTATTGGACAGTCAAGATAAAGATGAATGTGAATCTGCACTTGAAGCCCTTGGACAAATAGGATCGTCAATCCAAGGAGCTGTGTTACTACTGTCAAGTTTTCCGCCTGCTGCAAGGCATATAGTTCATGCTGCATTTGATCGGCAAGGACGTTGTAAACAGCTGGCTGCATTACATGCATTGGCGAATGTCACGGGAGAAAACCGGCCTGAGGATAGTGTTATCTTAAGCGGTGATGCAGAAGAAAGCCTTCGACGTTTGATCTATGAAGTAGCATCAGAAAGTTCAAAGCTGACACCATCTGGTCTTTTCCTATCAGTTCTTCAACAGGCTGCAGAATTTCGTCTGGCAGGGCACAGAGTGATAACAGGGTTAGTAGCTCGAGCTTGGTGCCTGATGGAGATTTGCTCAAAACAGGAGATAATAAACATGGTGACTGATCCAGCTACTGAAACTACAAAAATAGGTATGGAGGCTAGATATAAATGTTGCAAGGCAATCCACAGAGCATTCATGTCAAGTAAACTTGTTAGCGACCCTGCGCTTTCTGGTATAGCTGGGAAGTTGCAAGAAGCTGTTCAAAGAGGTCCATATCTGACAAGAAAACATACTGAAGCAGCTCCGGTAGTAATGACAGCTGAAAGATTTTAA
Protein:  
MDEEFSVDDPTHLLDSASEFAHHPGVQNDATTKDFLDRFPLPVIISALQTKADVPGLENTLADCLERVFKTKYGGSLIPQYMPFLQVGLKADSQMVRCLACKTVSSFLENFDDKSISAIQLIIDYDLYPLLLDCLIYGNEQVATAAIDAIKNLARFPEGMSIIFPANINEVAHLGNLASRCSSLGRARVLSLIVKLFSISSSVASVIYNSNLLSLLEAEIRNSNDTLVTLSSLELLYELTEIQHGTEFLSRTTLQLLHSIISNSSMEGILRSRAMMISGRLLSKENIYMFVDELSAKGVISAIDVRLGLLDSQDKDECESALEALGQIGSSIQGAVLLLSSFPPAARHIVHAAFDRQGRCKQLAALHALANVTGENRPEDSVILSGDAEESLRRLIYEVASESSKLTPSGLFLSVLQQAAEFRLAGHRVITGLVARAWCLMEICSKQEIINMVTDPATETTKIGMEARYKCCKAIHRAFMSSKLVSDPALSGIAGKLQEAVQRGPYLTRKHTEAAPVVMTAERF